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Annotation. 

We consider a class of two-dimensional functions f(x,y) with the property that the smallness 
of its rectangular norm implies the smallness of rectangular norm for f(x,x + y). Also we study a 
family of functions f(x,y) having a similar property for higher Gowers norms. The method based 
on a transference principle for a class of sums over special systems of linear equations. 



1. Introduction. 

The notion of Gowers norms was introduced in papers [2, 3] and is a very important tool 
of investigation in wide class of problems of additive combinatorics (see e.g. [2, 3] [8] — [20]) 
as well as in ergodic theory (see e.g. [15], [21] — [29]). Recall the definitions. 

Let G be a finite set, and iV = |G|. Let also d be a positive integer, and {0, l} d = 
{u = (u>i, . . . , u>d) : ujj G {0, 1}, j — 1, 2, . . . , d} be an ordinary d — dimensional cube. For 
uj G {0, l} d denote by \u\ the sum uj\ + • • • + ojd- Let also C be the operator of complex 
conjugation. Let x = (x\, . . . , Xd), x* = (x[, . . . , x' d ) be two arbitrary vectors from G d . By 
x^ = (xf, . . . , x^) denote the vector 

f Xi iiuji = 0, 



x 



x', if u):=\. 



Thus x^ depends on x and x'. 

Let X be a non-empty finite set, Z : X — > C be a function. Denote by ~EZ = K X Z the 
sum j^Y, x ex Z ( x )- 

Let / : G d — > C be an arbitrary function. We will write f(x) for f(x\, . . . , Xd). 

Definition 1.1 (see [2, 3]) Gowers U d -norm (or <i-uniformity norm) of the function / is 
the following expression 



w6{0,l} d 




l/2 d 
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A sequence of 2 d points x w , u G {0, l} d is called d -dimensional cube. Thus the summation 
in formula (1) is taken over all cubes of G d . For example, {(x, y), (x', y), (x, y'), (x', y')}, where 
x, x', y, y' G G is a two-dimensional cube. In the case Gowers norm is called rectangular norm. 

For d — 1 the expression above gives a semi-norm but for d > 2 Gowers norm is a norm. 
In particular, the triangle inequality holds 

\\f + 9\\u*< WfWui + hWu*- (2) 

One can prove also (see [3]) the following monotonicity relation. Let f Xd (x\, . . . ,Xd-i) '■— 
f(x 1 ,...,x d ). Then 

%c d eG\\fx d \\ud-i < \\f\\u* ( 3 ) 

for all d > 2. 

If G = (G, +) is a finite Abelian group with additive group operation +, N = |G| then 
one can "project" the norm above onto the group G and obtain the ordinary Gowers norm. 
In other words, we put the function f(x±, . . . ,xa) in formula (1) equals "one-dimensional" 
function f(xi,..., Xj) '■— f(xi + • — h Xa). Denoting the obtained norm as U k and we have an 
analog of (3) 

WfWu" < \\f\\u< (4) 

for all d > 2. 

Gowers norms have the following characteristic property. Let d > 2, x — (xi, . . . ,Xd) G 
G d be an arbitrary vector and i G {l,...,d}. By (x)(i) denote the vector (x)^ = 
(xi, . . . , Xi-i, x i+ i, . . . , Xd) G G d_1 . Applying the Cauchy-Schwartz (see [3] and [4, 5]) sev- 
eral times, we have 

Lemma 1.2 Let d > 2 be an integer, and f : G fc — > C be a function. Let also u±, . . . , : 
G d — > [—1,1] be any functions such that Ui(x) = i — 1, . . . , d, x — (x±, . . . , x^). 

Then 

x i=l 



< 



jjd 



Thus any function with small Gowers norm does not correlate with product of any functions 
which depend on smaller number of variables. 

In the paper we concentrate on the case of two-dimensional functions / :GxG^C,Gis 
a finite Abealian group. For any positive integer t one can consider U t+1 — norm of the function 

F t (x,yi,...,yt) ■= f(x,x + y!-\ Vy t ) and the function H t (x, y u y t ) := f{x,y 1 -\ \-y t ). 

It is easy to construct examples of functions f(x, y) with, say, huge rectangular norm and small 
quantity x + y)\\u 2 ("skew rectangular norm") and vice versa. Thus there is no obvious 

dependence between the numbers Hi^H^t+i and ||if t ||j7t+i in general. Nevertheless, we find a 
class of functions such that the smallness of ||i/ t ||;yt+i implies the smallness ||Ft||[/*+i. 

By G denote the Pontryagin dual of G. In other words G is the group of homomorphisms 
£ from G to R/Z, £ : x — > £ ■ x. It is well-known that in the case of Abelian group G the 
dual group G is isomorphic to G. 

Let us formulate one of the main results of the paper. 

Theorem 1.3 Let a : G — > G be a function and f(x,y) = e(x ■ a(y)), where e(x) = e 2mx . 
Then the condition 

\\f(x, yi + --- + y t )\\ ut+ i = o(l), N^oo (5) 
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implies 

+ + + =o(l), iV^oo (6) 

/or a// positive integers t. 

More precisely, we obtain a quantitative form of the formulas above (see Corollary 3.3). 
Note that more strong condition than (5), namely, \\f(x, y\ + • • • + yt+i)\\u t + 1 = trivially 
implies (6) (see proof of Corollary 3.3). Theorem 1.3 will be derived from the more general 
Theorem 2.4 of section 2. The result says, roughly speaking, that a class of sums of general 
form is taken over systems of linear equations including an arbitrary function a(x) can be 
reduced to sums having no these linear restrictions (see Theorem 2.4). It is interesting we do 
not use Fourier analysis in the proof. Our main tool is a suitable version of Lemma 9.3 from 
Gowers' paper [3]. 

Let us say a few words about the notation. If S C G is a set then we will write S(x) 
for the characteristic function. In other words S(x) — 1 if x G S and zero otherwise. By log 
denote logarithm base two and by D denote the unit disk on the complex plane. Sings <C and 
^> are usual Vinogradov's symbols. If n is a positive integer then we will write [n] for the 
segment {1,2,..., n}. 

The author is grateful to N.G. Moshchevitin for useful discussions. 

2. The proof of the main result. 

We need in a number of definitions. 

Let / > 2, m be any positive integers, m < I. Let also x±,...,xi, yi,...,yi be some 
variables. Consider a system of linear equations 

i 

J>J% = 0, i = l,2,...,m, (7) 
j'=i 

and also an equation 

i 

y £e? ) y j = 0. (8) 

Here ej" G {0, —1, 1}. So, if we know system (7) then we automatically know equation (8). 
Suppose that the rank of subsystem (7) equals m. 

Let S be the family of all systems from m equations of full rank having the form 
Yl l j=iVj x j = 0> Vj £ {Ojil} such that rj can be written as a linear combination of vectors 

\ . . . , s^), . . . , (e^, . . . , e[ m ^) from (7). For example, we can multiply some equations of 
(7) by —1 and get a system from S. Thus \S\ > 2 m . On the other hand, \S\ < 3 lm . Clearly, 
if a tuple (#!,..., xi) is a solution of some system from S then it is a solution of any system 
from S, in particular, system (7). By T denote system (7), (8). 

Let E be a family of linear equations with coefficient {0, ±1} which can be written as a 
linear combinations of some equations from S. Clearly, \E\ < 3'. Suppose that e G E and 
define 9(e) = 2*, where t is the number of zero coefficients in e. Put also 9(v) = Yleev ^( e )> v 
is a system. Lastly, let 

6 = 9 m _ 1 (E) = J20(v), 

V 

where summation is taken over all systems with m — 1 equations from E. 
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Let £>!,..., Bi C G be arbitrary sets. Suppose that there are some maps y?i, . . . , ipi, 

Finally, let C be a subset of G*. We will call a tuple (xi, . . . ,xi) satisfying a system 
from S additive. An additive tuple (xi,...,xi) is called degenerate if there is a non-zero 
vector ff = (rji, . . . ,r)i), rjj G {0, ±1} such that Y^j=iVj x j = an d the last equation does not 
belong E. Otherwise, such a tuple will be called non- degenerate. Note that non-degenerate 
additive tuples can satisfy equations from E only Further, non-degenerate additive tuple 
(xi, . . . ,xi) G C is called good if (xi, . . . , xi), (<pi(xi), . . . , <pi(xi)) satisfies (7), (8) for some 
system from S, and bad otherwise. So, by definition, good and bad tuples are non-degenerate 
and belong to C. 

First of all we prove the following simple extension of Lemma 9.3 from [3]. 

Lemma 2.1 Let a,ou,r) G (0,1] be any numbers, Bj C G be arbitrary sets, and C be a 
subset ofG 1 . Suppose thatT is a parameter, the number of additive tuples in (BiX. . .xBi) f]C 
is at most oj~ x T , 

T>2 2l \S\ [ +UJ M N i-m-i^ (9) 
V arj ) 

and there exist at least aT good tuples. Then there are some sets Bj C Bj such that the 
number of good tuples (x±, . . . , Xi) G (B[ x . . . x B[) f]C is at least 

0(7] \ 



2(1+0;-! 



and the ratio of good tuples in (B[ x . . . x B[) f]C to the number of bad tuples in 
(B[ x . . . x B[) f|C is at least (1 - 77). 

Proof. Let A; be a natural parameter and choose r 1 ,...,Tfc, si,...,Sk, 

. . . , . . . , w^ n ~ 1 \ . . . , Wfc™ -1 '' G G uniformly and independently. After that in- 
dependently choose points Xj G Bj such that a point Xj go into a new set Bj with 
probability 

j k m—1 

P( x i) = ^k II + cs ( r ^' + ' II ( X + cs K (<?);r i)) ' 

i=l g=l 

where cs (x) = cos(^x). A tuple (x±, . . . , xj) belongs to S| x . . . x with probability 

k 1 

Yl Yl [][](l + cs(r^ + ^)))x 



2mlkJ^(m+l)k 



ri,-,r k si,...,s fc (i) (i) ^.t™- 1 ) «=1 i=l 

m—1 



x n (i+csk ( S)) 



<?=! 

[ / ' m—1 

Tf^+iW Z Z 11(1 + cs (rarj- + sVjCxj-))) JJ^ + ^K^)) 



2mlk J\J (m+l)k 



r,s w 1 ,...,w m -i j=l 



1 f2- /m V V 

2mlkJ\f(m+l)k v / / / / 

r,s iui,...,UJ m _i 
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I m—1 

JJ(2 + e(rxj + s(pj{xj)) + e(-rxj - s(pj(xj))) JJ (2 + e(w ? ^) + e(-w q x j ))) k = 

j=l q=l 



(O-lm \- V TT V o\{i,3--e ^=0}| 

2mifej\r(m+l)A: 11 

r ' s wl '-"' Wm - 1 J= 1 e ( 1) ,..., ei (1) ,...4 m) v--,ei (m) e{o,±i} 

xe(r ^ e^xj + s ^ efVjfo) + Wl ^ eJ 2) Xj + • • • + w m -i E £j n) x j )) k ■ 
j j j j 

Thus, the last probability does not equal zero if there is a tuple (ef), ef e {0, ±1} such that 

E = E = E *T*> = ■ ■ ■ = E ^ = • ( 10 ) 

j j j j 

It is so, for example, if all equal zero. 

Let (xi, . . . ,xi) be a bad tuple. Then, by definition, (x±, . . . ,xi) satisfy a system v G S 
and is non-degenerate. It is easy to see that there is no vector (e^p, . . . ,e^) ^ such that 
(10) holds. Indeed, otherwise we can add some equations from v and obtain a contradiction 
with the fact that ( bad sequence. Thus, every bad additive tuple is chosen 

with probability 2- lmk (2- lm 2 l 9) k . Clearly, 9 > 2 i ( m ~ 1 ) and put 9 = 2 l{ - m ~ 1 ^ + 9 U 9 X > 0. Then 
the last probability equals 2- lmk (l + 9 1 2- l(m '^) k . 

On the other hand, it is easy to see that a good additive tuple is chosen with probability 
at least 

2 -imk^ 2 -im^ 2 i e + 2 0))fc > 2- lmk (l + 9 1 2~ l{m - 1) + 2^ 1] ) k > 2~ lmk (l + 1 2~ ,(m-1) )* . 

Note that there are at most 3 l \S\ . . . |.Bj_ ro _i| < 3 i |S'|iV / - m - 1 degenerate tuples. Now, 
let X and F be the numbers of good and bad additive tuples in B[ x . . . x B[. Using the 
assumption of the lemma, we get 

EX > 2- lmk (l + 9 1 2- l{m - 1) + 2- (l - 1) ) k aT 

and 

EF < 2~ lmk (l + 1 2- , ( m - 1 )) fc w _1 T . 
It is easy to see that 9 1 < 9 < 3 lm 2 lm . Since (1 + 2^ lm ) k > 2 k2 ' 3lm it follows that 

1 + , l2 -^-D J >(l + 2 3 -) fc >2- 

and the last expression is at least (1 + uj~ l )/(ari), provided by we choose an integer k from 
the conditions 



2(1 + LU- 1 ) J - + 

Hence 

7]EX-EY > «r72"' m/c (l+^ 1 2"' (m " 1) +2" ( ^ 1) ) fc T-2^ mfc (l+^ 1 2^ {m " 1) ) fe cj" 1 T > 2~ lmk T . (12) 
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Using (9), (11), we have 

2 -i m k T > 2 / aV ) T>2-A l \S\N l - m - 1 >2(3 , |S'||Si|...|S,_ m _i|). (13) 

\2(1 + LO ) J 

By (12), (13), we obtain that there are sets B[ C B±, . . . , B[ C Bi such that r/X > Y and 

X > ( 2(^-1) ) T.This completes the proof. 

Note 2.2 Certainly, one can suppose that our tuples (x±, . . . , xi), (ip(xi), . . . , <p(xi)), Xj G Bj 
satisfying (7) and (8) for some system v from S, satisfy a new system 

i 

2>J% = 0, i = l,2,...,m, (14) 
j'=i 

and 

i 

£efV(*i) = 0, (15) 
j'=i 

where £^ G {0, ±1}, £^ = £^ for any j > 1, i > 2, the rank of system (14) equals m, and 
e^p depend on v. 

Definition 2.3 Let L, I be positive integers. A set f2 C G' is called a set of level L if 

n = - n?) 

where each fi", Q!- C is a Cartesian product of some sets. We say that Cartesian 
products are sets of level zero. 

Let C be a set, and let / be a complex function which depends on variables (x±, . . . , x{) G 
(£>! x ... x B{) H C, x = (xi,...,xi) and / also depends on ip\{x\), . . . , <pi(xi), (p(x) = 
((pi(xi), . . . , ipi(xi)). Let v be a system from S and 

a f:V (B 1 ,...,B l ;C) := /(xi, . . . , x h (^i(xi), . . . , <pi(xi)) . (16) 

(x,<p(x)) satisfies (7), (8) for ves 

Further o~f(Bi, . . . , Bf,C) = ^2 veS \o~f jV (Bi, . . . , Bi;C)\. Take an arbitrary number p G [I], 
express x p from (7), (p p (x p ) from (8) and substitute x p , (p p (x p ) into /. After that choose 
another m — 1 variables x^, . . . , x J - m _ 1 express them from (7) and substitute into /. Let 
j = (ji, . . .,j m -i). We get a new function 

f p j(%) ■= f p j( x j, Vj&i)) > P ^ O'i. • • • . Jm-l} • 
Let ||/|| W d = iV^H/H^d and for any p 1 , p 2 G [d], pi < p 2 , we put 

II f\\u 2 (pi,p2) = ^ ^ ^ ^ f( x li ■ ■ ■ i x pi ■>•••■> x P2i ■ ■ ■ i %d) f( x l, ■ ■ ■ i %' pi i ■ ■ ■ i %p2 i ■ ■ ■ i x d) ^ 



C P1 > X P2 



* / ( x 1 5 • • • 5 "Epi 5 • • • 1 x P2 i ■ ■ ■ i X d) f ( X 1 1 ■ • ■ 1 X p\ i • • • 1 X p2 i • • • i X d) 
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Formulate the main result of the section. 

Theorem 2.4 Let e G (0, 1] be a real number, e < 2~ 2 1 m 2 m , and f be a complex 
function, f : G — > D. Let also B ± , . . . ,B t be arbitrary sets. Suppose that for any v G S there 
are j, p G [/] and p\ ^ p, p 2 ^ p, P\ ^ P2 such that 



ii/;#)ibw)<^' • (17) 

Then 

( / , 19 v (128«m2 3 - i )- 1 / 9 2i\ (16/m2 3m ')- 1, j 

"<* * G, ) £ —{ 230 (i^7i)) - 4 u) r N " m - (18) 

Proof. Let £i be the maximum in the right hand side of (18) divided by N l ~ m . Let also 
v E S and denote by tj jV (Bi, . . . , Bf,C) the number of solutions of system (14), (15). By 
Tf(Bi, . . . , Bf, C) denote the sum J2 v es r f,v(Bi, . . . ,Bf,C). Suppose that inequality (18) does 
not hold. Since r f (B u . . . , B x \ G l ) < \S\N l ~ m and 

a f (B 1 ,...,B l ;G l )<r f (B 1 ,...,B l -G l ), (19) 

it follows that (T f (B 1 , ...,B t ; G l ) > e 1 \S\~ 1 r f (B 1 , ...,Bf, G l ) and T f (B 1 , ...,Bf, G l ) > 

We need in a "density increment" lemma. 

Lemma 2.5 Let a,u G (0, 1] be any numbers, Bj be arbitrary sets, and C be a set of level 
L. Suppose that 

r f (B 1: ...,B l ;C)> \S\ max{2 2 ' ( ^±±^M) ^ N l ~ m ~ l ,uN l ~ m } , (20) 

V a£ i J 



< o-4L-i6|ci-8 r 4/' Q^i A " f r f (B 1 ,...,B l ;C) \ 

and f : G — > D is a function satisfying (17). Let also 

a f (B u ...,Bi;C)> ar f (B 1 , B t ; C) . (22) 

Then there is a set C C C o/ /ewe/ at most L + 1 and there are sets Bj, j G [I], (B[ x . . . x 
B[) H C = snc/z £/ia£ 

^(Si, ...,£*,;£)< ^(Sx, . . . , Bf, C) + 2- 2 £l |5|- 1 r / ( J B;, . . . , B[; C) . (23) 

and 

r s {B u ..., B i; C) < (1 - 07/(5!, . . . , S,; C) , (24) 

/ \ Im2 3ml 

w/iere C = ( 3 2(l"-i)|5| j 

Proof. Using Lemma 2.1 with parameters a,u,r] = |S'|~ 1 £ 1 /16 and T = Tf(B 1 , . . . , Bf, C), 
we get the sets Bj C Bj, j G [/] such that the number g of good tuples (xi,...,x{) G 
(B[ x . . . x f| C is at least 

/ \ lm2' iml 

g > ( 2(1 t /( s i> ■ ■ ■ > ^ c ) = ^ 5i > • • • > ^ °) • ( 2s ) 



Thus 

r f (B[, ...,B' l ;C)>g> tr f {B u ...,B t ;C). (26) 
Let (Bj) 1 = B'- and {B\f = Bj \ B'-. Then for any v G S, we have 

a ftV {B u ...,BfiC)= £ VfA(BT\...,(B' i r;C)= £ a ftV (w;C) , 

^£{0,1}' ^€{0,1}' 

— * 

where w = (wi, . . . , w{). Without losing of generality, assume that p = I and j = 
(xi- m+ i, . . . , Xi-i). Consider the term a\(v) := af >v (B' 1 , . . . , B[; C) which corresponds to 
W\ = ■ ■ ■ = w\ = 1. We have 



£M«)I<£ 

ties' ves 



xe(B[x...xB' l )f}c,(x,<p(x)) satisfies (7), (8) for v 



+ 



+r ] r f (B[,...,B' l ;C) = a[ + a';. 
It is easy to see, using (17) and Lemma 1.2 that 

a[ < 2 i | 1 S|e 1/4 iV'- m < 2 L 3 lm e 1/i N l ~ 



(27) 



(28) 



Put C = ((Si x...xB,)\(B;x...x B[)){\C. Clearly, C is a set of level L + 1. Using 
inequalities (27), (28), we obtain 



(29) 
(30) 



a f (B u ...,B r ,C) = J2 WfA B i, ...,B l ;C)\<a' 1 + a l l + a f (B u ...,B r ,C)< 

ves 

< 2 L 3 lm e 1 / A N l ~ m + V r f (B[, B[- C) + a f (B u ...,B t ;C). 
Recalling that rj = |S'| _1 £i/16, using condition (21) and formulas (26), (29), we get 

a f (B u . . . , B f , C) < a f {B u . . . , Bu C) + 2- 2 e 1 \S\- 1 r f (B[, B\\ C) . 

Finally, 

T f (B u ..., Bu C) + r f (B[,..., B\- C)=T f (B u ..., B t ; C) 

and by (25), we obtain (24). This concludes the proof of the lemma. 

Now return to the proof of the theorem. We use Lemma 2.5 inductively. At zeroth step, 
we have a > |S'| _1 £i and oo is any positive number such that oo < E\\S\~ l . Suppose that our 
algorithm was applied h times. Thus we obtain the sets Cj, j G [h], every Cj is a set of level j 
and two sequences of d, . . . , (h, ■ ■ ■ , uih, each Q depends on ojj. Using inequality (23), we 
see that for any j the following holds 

a f {B u ... , Bu G l ) <a f {B u ..., Bu Cj) + 2- 2 e 1 \S\- 1 r f (B 1 , ...,B t ; G l ) < 

<a f (B 1 ,...,BuC :i ) + 2- 2 e 1 N l - m . 

So, we can assume that for all j G [h], we have <jf(B 1 ,...,Bi;Cj) > 
e 1 (2\S\y 1 Tf(B 1 , . . . , Bf, Cj). Further, in view of (19), we can suppose that for all j G [h] 
the following holds Tf(B 1 , . . . , Bf,Cj) > 2~ 1 e 1 N l ~ m . Thus for any j G [h], we can take 
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3 mi 



/ 2 \ ImZ" 

:= 2" 1 £ 1 |S'|" 1 and hence for all j G [h] the following holds Q > ) = C*- 

By (24), we get 

r f (B 1: ...,Bi; Cj) < (1 - C*) J T/(5i, •••>£»; G') < (1 - C) J |5|^- m 
and our algorithm must stop after at most 2 4 log(2|<S'|£ 1 ) ( . ... I2 ) := h number 

\04:{1+2£ 1 \b\)\b\ / 

of steps. Clearly, the maximal level of any set which appears in our induction procedure does 
not exceed h. Using the inequality Tf(Bi, . . . , Bi;Ch) > 2~ 1 eiN l ~ m , we get from (21) the 
dependence between parameters e and S\ 



g < ^1^^ ( , (31) 



2 \ 4Zm2 



Besides, by (20), we obtain 



^^64(1 + 2^1)1^, ^ (32) 



£ 



1 

If (31) and (32) hold then we have a contradiction. Small calculation shows that the last two 
inequalities imply (18). This completes the proof of Theorem 2.4. 

Note 2.6 Probably, the result above suggests that we have a phenomenon in spirit of the 
well-known sum-product phenomenon (see e.g. [6] or [7]) or the dichotomy phenomenon 
(see e.g. [14]). Indeed, Theorem 2.4 is trivial in two opposite cases : if a(x) is a linear 
function then <jf(Bi, . . . , Bf, G) is small by (17) and if a(x) is far from all linear functions (e.g. 
a(x) = x 2 ) then (18) is small by (19) and an appropriate upper bound for tj(Bi, . . . , Bf, G). 
So, in some sense our result can be interpreted that there is no function linear and non-linear 
simultaneously. 

3. Applications. 

First of all let us note a simple property of Gowers norms (see e.g. [3]). 

Lemma 3.1 Let d > 2 be an integer, and f : G k — > C be a function. Let also u±, . . . ,Ud : 
G d — > [—1,1] be any functions such that Ui(x) = Ui((x)^)), i = l,...,d, x = (x±, . . . , x^). 
Then 

d 

ii/(*)e(n^))ii^ = wmwu*- 

i=i 

Also, we need in a convexity lemma. 

Lemma 3.2 Let k > and h(x) = l/(\ogx) K . Then for any real numbers xi, . . . , xn > 1, 
we have 



i=i \ i=i / 



N 

Now we obtain a corollary of Theorem 2.4. Clearly, the result below implies Theorem 1.3. 
Corollary 3.3 Let e G (0, 1) be a real number, a : G — > G be a function, and f(x,y) = 
e(x ■ a(y)). Suppose that 

\\f(x,y)\\ u2 <e. (33) 
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||/(x,, + !/ )||„ 1 <3^„ax|2»( i3i | M ) 2 ,4g) 2 JjV*. (34) 
If t is a positive integer and 

\\f(x, yi + --- + y t )\\ ut+ i<e (35) 

then 

\\f( x , x + y 1 + ... + y t ) \\ ut+1 « t max [ lQ J 1/£ y W _c(t) } ^ > ( 36 ) 

where c(t) > some constant depends on t only. 

Note 3.4 Clearly, formula (34) of the corollary above holds for any function f\(x,y) = 
e((x — A) • a(y)), where A G G is an arbitrary element. 

Proof. For w = (wi, . . . , wa), d E N and cu e {0, l} d we write for Yli=i w f- Any 
sequence of points (iUu;)u;e{o,i} d £ G is called d-dimensional cube (see [3]). So, we numerate 
the points of an arbitrary rf-dimensional cube by index u e {0, l} d . 

First of all let us note that the dimension of all d — dimensional cubes equals d + 1 and 
any d — dimensional cube (zi, . . . , z 2 d) is a solution of a system of full rank having Ef=o if) = 
2 d — d — 1 linear equations, say 

E. e{0 ,i } ^-(-i) M = o, 

£. e{ o,i}<^-(- 1 ) M = ' Ui = Me[d\- 
E we{ o,i}d ^ • (-l)M = , u h = --- = uj id ^ 2 = 0, ij E [d] . 

By S denote the last system. The fact that S has rank equals 2 d — d — 1 can be obtained 
by successfully considering of its equations from the first to the last (see variables 
Zo,i,...,i, ■ ■ ■ ,Zi,..,i,o and so on). 
Our task is count the quantity 

(r t :=\\f{x,x + yi + --- + y t )\\ut+i = Y^ Y Y e ( Y x ^ ' a ^ + f(«a.-,«t+i))) ' 

x,x' yi,-,y t y[,...,y' t ^e{0,l} t+1 

where y = (yi, . . . ,y t ). Let us change the variables x —> x — yi, x' —> x' — y±, y[ — yi — A. Let 
z = (x, A,y 2 ,...,y t ),ue {0, l} m , w = (u>i, 77, w 2 ), where Q lt r) e {0, 1}, u 2 e {0, l}*" 1 . Put 
— ^wi + ??A + y^. So, z w depends on x, A, y. We write r] = ^{z^). Clearly, 

a t = Y Y e ( Y a ( Xa ^ + ^ A + )) x 

x,x',A y 2 ,...,yt,y' 2 ,...,y' t wG{0,l} t+1 

xN5 ( Y (-l) H «0O), (37) 
^e{o,i}' +1 

where <5o(x) = 1 if x = and zero otherwise. It is easy to see that the sequence (^) a ;e{o,i}'+ 1 
form a cube. So, our sum o~ t has form (16). To prove Corollary 3.3 we need to check condition 
(17) but firstly let us prove the corollary in the simplest case t — 1. In the situation formula 
(37) is 

<Ti = Y e(xa(x)-x'a(x')-xa(x + A)+x'a(x' + A))-N5 (a(x)-a(x')-a(x + A) + a(x' + A)) 

x,x',A 
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= E e (( x - x')(a(x) - a(x + A))) • N5 (a(x) - a(x') - a(x + A) + a(x' + A)) 

x,x',A 

and condition (17) holds because (33) and S contains just two equations in the case. Thus, 
we prove (34), provided by t = 1. 

Now suppose that t > 2. Let Ylu, a i z u)Vu = be an equation from S. Suppose that there is 
fj u 7^ such that 77(2^) = 1 (see definition of zj). Without losing of generality, one can suppose 
that fj^ = —1, where 1 = (1, . . . , 1). Then a(z^) = ^ w a{zj)f} u and we can substitute it 
into formula (37). After that change the variables x — > x — y 2 , x' — > x' — y 2 , y' 2 — 2/2 = Q- 
Consider the variables x', q, y' 3 , . . . , y' t . We see there are terms e(x'a(x + q + y 3 + • • • + y' t )), 
e(x'a(x + A + q + y' 3 + • • • + y' t )) and may be the term e((x' — y 2 )a(x f + q + y' 3 + ■ ■ ■ + y' t )), 
containing x', q, y' 3 , . . . , y' t but there is no term e((x' — y 2 )a(x' + A + q + y' 3 + • — h y' t )) in the 
expression for a t . Put 

®ylxA( x> , 9, 2/3, • • • , y't) = + q + V3 + --- + y't)+ x'a(x + A + q + y' 3 + ■ ■ ■ + y' t )) (38) 

and 

©S,,a(^, Q, Vs, ■ ■ ■ , 2/0 = ©^,a(^ ? 9, 2/3, • • • , V't) ■ e((x' - y 2 )a(x' + q + y' 3 + ... + y' t )) (39) 

provided by the term e({x' — y 2 )a(x' + q + y' 3 + • • • + y' t )) exists. Clearly, another multiples do 
not contain these variables. Using Lemma 3.1 and summation over y 2 , we get 

E K?,^''^ ■■■,y't)\W< N 2 \\f(x, yi + ■■■ + y t )\\ ut+1 . 

Similarly, summing over x, A, we have 

E HZ^' 9» 2/3, • • • , 2/0 llw < ^ || / (x, yi + • • • + y t ) \\ u ^ ■ 

By Lemma 3.2, we obtain (36). Now suppose that for our equation a(z u )fj u = from S 
for any fj^ 7^ we have ^(z^) = 0. But changing the variables y 2 ^ y 2 — A, y' 2 ^ y' 2 — A we 
reducing our expression to the previous case. This concludes the proof. 

Note 3.5 If we take a(x) = const then, clearly, (34) does not hold. On the other hand it is 
easy to see that (33) is also has no place. If we put a(x) = x then (36) and (35) do not hold 
for t = 2. For t > 2 take a(x) = x l ~ x . Then the norm of the functions from (35) and (36) 
equal one. Thus, in some sense our conditions (33), (35) are necessary. 

Note 3.6 Let us analyze a little bit more general functions of the form f(zi + --- + z r + 
Xi + ■ ■ ■ + x s , x\ + • • ■ + x s + yi + ■ ■ ■ + y t ), r is a positive integer, s, t > 0. If r > 2 then 
||/ 1| jjs+t+r = 1 for any function a(x). Further, if r = 0, t — and G = Z p , p is a prime number 
then put a(x) = x^ 1 . It is easy to see that inequality (33) takes place (e.g. see Corollary 
3.7) and \\f\\u s = 1- The case r = s = is not interesting and consider the situation r = 0, 
s > 2, t > 1. It is easy to see that ||e(xa(yi + ••• + y s +t)\\u s + t + 1 — trivially implies 
+ - • - + x s ,Xi + - ■ - + x s + yi + • ■ ■ + yt)\\u s + t — o(l) (see the first step of proof of Corollary 
3.3) and for a(x) = x s+t ~ 2 , we have ||/(xi + • • • + x s , x± + ■ ■ ■ + x s + y\ + ■ ■ ■ + yt)\\u s+t = 1- 
Finally, in the case r = 1, s,t > 1 we can apply the same argument. Thus, the choice r = 0, 
s — 1, t > 1 is the only situation when we can obtain non-trivial upper bounds (under some 
analogs of assumption (35)) for f{z\ + • — h z r + X\ + • • • + x s , x± + • — h x s + y\ + • — V y t ). 
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Corollary 3.7 Let K be a positive integer, a : G — > G be a function, and f(x,y) = 
e(x ■ a(y)). Suppose that 

\{yeG : a(y)=j}\<K, jeG. (40) 

Then 

\\f(x, x + y)\\ u > <3 4 max ( 2 3 ° ( T7r ^ 7rT ) ,4( ^ ] }> iV 4 

w/iere e„ = (K/N) 1 ^. 
Proof. It is easy to see that 



io g (i/o; ' \n 



\\f^y)\\m = ^i E e((x-ar')(a(y)-a(i/))) = ^X; 

x,x',y,y' jeG 



where Mj = {y G G : a(y) = j}. By assumption, we have |Mj| < X for all j G G. Besides 
\Mj\ = N. Hence \\f(x, y)\\u 2 < (K/N) 1 ^ 4 and the statement follows from Corollary 3.3. 

This completes the proof. 

Thus, if, say, K = N 1 '", N -> 00, re > then a: + y)\\ V 2 « l/(logiV) c , where 

c = c(re). Even in the situation when preimage of any (or almost any) point j G G has the 

cardinality o(N) we have a non-trivial bound for \\f(x, x + y)||c/ 2 - 
Note 3.8 It is easy to see that 

\\e(xa(y 1 + --- + y t ))\\ ut+ i = N 2 .#{y 1 ,...,y t ,y' 1 ,...,y' t : E a(j/ w )(-l) M = 0} . 

we{o,i}' 

So, condition (35) can be interpreted as the requirement for a(x) to be far from "polynomial 
of degree t — 1" (or (t — l)-step nilsequence, more precisely, see e.g. [9, 24, 27]). 

Finally, we apply Corollary 3.7 to a family of subsets of G x G, G = Zjy. Let us recall 
some formulas from Fourier analysis. Let / : G — > C be a function. By /(£) denote the 
Fourier transformation of / 

/(0 = E /(x)e(-e • x) , (41) 



We have 



and 



E l/^)! 2 = ^ E l/(0l 2 ( p ™al identity) (42) 
/(x) = E /(0 e (£ ' x) (the inverse formula) . (43) 



N 

?6G 



If 



then 



(/ * 0)0*0 := E f(v)9( x - v) 

yeG 



f*9 = f9- (44) 

Now we can formulate our corollary. 
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Corollary 3.9 Let G = Z^, N be a prime number. Let also a : G — > G be a function, 
a(y) 7^ for all y e G. Suppose that P is an arithmetic progression, \P\ > 2 5 N 3 / 4 , f(x,y) = 
P(x ■ a{y)) - \P\/N, and inequality (40) holds with K < 2~ 18 |P| 4 /iV 3 . Then 

ii^+*^ 3W { 2M (i^) 2 ' 4 © 2 "} Ni - (45) 

where e* = (4K/N) 1 / 48 . 

Proof. To reduce some logarithms in our bounds we use a well-known trick. Without 
losing of generality, suppose that P = {0, 1, . . . , \P\ — 1}. Let P 1 = {0, 1, — 1}, t = 
[|P|2/3^-i/i2 Jv i/4 2 ] > and W ( x ) = |p 1 |-i(p*p 1 )(a;). Clearly, < < 1, E,^) = 1^1 

and W(x) = P(x) for all but at most 2t points x G G. By Parseval, formula (44) and the 
Cauchy-Schwartz, we obtain 

Y\ \W(r)\ < ^yfNlPTlyfNlPl = N^'h- 1 ' 2 . (46) 

r 

By the triangle inequality for U 2 — norm and (43), we get 

||/||^< ||^-|P|/iV|| w2 + 32-tiV 3 = 

= S W(ri) . . .W(r A ) ^2 e(r 1 xa(y))e( y -r 2 x'a(y))e(-r 3 xa(y'))e(r 4 x'a(y')) + 

(n,...,r 4 )^0 x,x',y,y' 

+32 • tN 3 . (47) 

Clearly, ||e(r-xa(y))|| [ /2 = || e(xa(y))\\ V 2, provided by r ^ 0. We have ||e(xa(y))||j72 < (K/N) 1 ^ 
(see the proof of Corollary 3.7). Using the last inequality, Lemma 1.2, formulas (46), (47), we 
obtain 

11/11* < (S) 2 (|) V4 iV 3 + 32 ■ tiV 3 < 2 8 (|) 1/12 iV 4 . 

Using Corollary 3.3, we get (45). This concludes the proof. 

iVote 3.10 There is another alternative way to obtain the smallness of || • \\ V 2 — norm of 
the function f(x,y) = P(x ■ a(y)) - \P\/N = P(x ■ y) - \P\/N in the situation G = Z N 
and a(y) = y. It is easy to see, using multiplicative characters, that the ||/||c/ 2 is small by 
Polya — Vinogradov inequality (see e.g. [1]). 
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